Handling Noisy Queries in Cross Language FAQ Retrieval
نویسندگان
چکیده
Recent times have seen a tremendous growth in mobile based data services that allow people to use Short Message Service (SMS) to access these data services. In a multilingual society it is essential that data services that were developed for a specific language be made accessible through other local languages also. In this paper, we present a service that allows a user to query a FrequentlyAsked-Questions (FAQ) database built in a local language (Hindi) using Noisy SMS English queries. The inherent noise in the SMS queries, along with the language mismatch makes this a challenging problem. We handle these two problems by formulating the query similarity over FAQ questions as a combinatorial search problem where the search space consists of combinations of dictionary variations of the noisy query and its top-N translations. We demonstrate the effectiveness of our approach on a real-life dataset.
منابع مشابه
Improving Accuracy of SMS based FAQ Retrieval
Improving Accuracy of SMS based FAQ Retrieval Aparna Joshi Department of Information Technology Army Institute of Technology, Pune University of Pune Maharashtra, India _______________________________________________________________________________________ Abstract: Short messaging service (SMS) is a convenient technology for a user to get the needed information delivered on demand. The automat...
متن کاملDCU@FIRE 2011: SMS-based FAQ Retrieval
This paper gives an overview of DCU’s participation in the SMS-based FAQ Retrieval task at FIRE 2011. DCU submitted three runs for monolingual English experiments. The approach consisted of first transforming the noisy SMS queries into a normalised, corrected form. The normalised queries were then used to retrieve a ranked list of FAQ results by combining the results from three slightly differe...
متن کاملSMS based FAQ Retrieval using Theme Matching Scheme
As a participant of FIRE 2012 monolingual English SMS based FAQ Retrieval Task, we proposed a theme matching scheme [1]. Once again the scheme is implemented for the same task in FIRE 2013 having different set of SMS and FAQ queries. An SMS text usually consists of certain noisy terms due to the limitations of characters allowed in an SMS, lack of screen space and unintended typographical error...
متن کاملAutomated FAQ Answering: Continued Experience with Shallow Language Understanding
The subject of this research is development of an evolving automated FAQ (Frequently Asked Question) answering system that provides pre-stored answers to user questions asked in ordinary English. The natural language processing technique developed for FAQ retrieval does not analyze user queries; instead, analysis is applied to FAQs in the database long before any user queries are submitted. Thu...
متن کاملDetecting Missing Content Queries in an SMS-Based HIV/AIDS FAQ Retrieval System
Automated Frequently Asked Question (FAQ) answering systems use pre-stored sets of question-answer pairs as an information source to answer natural language questions posed by the users. The main problem with this kind of information source is that there is no guarantee that there will be a relevant question-answer pair for all user queries. In this paper, we propose to deploy a binary classifi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010